Dataset statistics
| Number of variables | 28 |
|---|---|
| Number of observations | 150000 |
| Missing cells | 140124 |
| Missing cells (%) | 3.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 32.0 MiB |
| Average record size in memory | 224.0 B |
Variable types
| Categorical | 19 |
|---|---|
| Numeric | 8 |
| Unsupported | 1 |
ID has a high cardinality: 150000 distinct values | High cardinality |
Customer_ID has a high cardinality: 12500 distinct values | High cardinality |
Name has a high cardinality: 10139 distinct values | High cardinality |
Age has a high cardinality: 2524 distinct values | High cardinality |
SSN has a high cardinality: 12501 distinct values | High cardinality |
Annual_Income has a high cardinality: 21192 distinct values | High cardinality |
Num_of_Loan has a high cardinality: 623 distinct values | High cardinality |
Type_of_Loan has a high cardinality: 6260 distinct values | High cardinality |
Num_of_Delayed_Payment has a high cardinality: 1058 distinct values | High cardinality |
Changed_Credit_Limit has a high cardinality: 4605 distinct values | High cardinality |
Outstanding_Debt has a high cardinality: 13622 distinct values | High cardinality |
Credit_History_Age has a high cardinality: 408 distinct values | High cardinality |
Amount_invested_monthly has a high cardinality: 136497 distinct values | High cardinality |
Num_Bank_Accounts is highly overall correlated with Interest_Rate and 1 other fields | High correlation |
Interest_Rate is highly overall correlated with Num_Bank_Accounts and 2 other fields | High correlation |
Delay_from_due_date is highly overall correlated with Num_Bank_Accounts and 1 other fields | High correlation |
Num_Credit_Inquiries is highly overall correlated with Interest_Rate | High correlation |
Num_of_Loan is highly imbalanced (60.9%) | Imbalance |
Num_of_Delayed_Payment is highly imbalanced (50.9%) | Imbalance |
Name has 15000 (10.0%) missing values | Missing |
Monthly_Inhand_Salary has 22500 (15.0%) missing values | Missing |
Type_of_Loan has 17112 (11.4%) missing values | Missing |
Num_of_Delayed_Payment has 10500 (7.0%) missing values | Missing |
Num_Credit_Inquiries has 3000 (2.0%) missing values | Missing |
Credit_History_Age has 13500 (9.0%) missing values | Missing |
Amount_invested_monthly has 6750 (4.5%) missing values | Missing |
Monthly_Balance has 1762 (1.2%) missing values | Missing |
Credit_Score has 50000 (33.3%) missing values | Missing |
ID is uniformly distributed | Uniform |
Customer_ID is uniformly distributed | Uniform |
Month is uniformly distributed | Uniform |
ID has unique values | Unique |
Credit_Utilization_Ratio has unique values | Unique |
Monthly_Balance is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Num_Bank_Accounts has 6494 (4.3%) zeros | Zeros |
Delay_from_due_date has 1821 (1.2%) zeros | Zeros |
Num_Credit_Inquiries has 8074 (5.4%) zeros | Zeros |
Total_EMI_per_month has 15615 (10.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-04-06 12:09:22.845305 |
|---|---|
| Analysis finished | 2023-04-06 12:09:57.509888 |
| Duration | 34.66 seconds |
| Software version | ydata-profiling vv4.1.1 |
| Download configuration | config.json |
ID
Categorical
HIGH CARDINALITY  UNIFORM  UNIQUE 
| Distinct | 150000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 0x160a | 1 |
|---|---|
| 0x13b0a | 1 |
| 0x13af2 | 1 |
| 0x13af3 | 1 |
| 0x13af4 | 1 |
| Other values (149995) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.6006533 |
| Min length | 6 |
Characters and Unicode
| Total characters | 990098 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 150000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 0x160a |
|---|---|
| 2nd row | 0x160b |
| 3rd row | 0x160c |
| 4th row | 0x160d |
| 5th row | 0x1616 |
Common Values
| Value | Count | Frequency (%) |
| 0x160a | 1 | < 0.1% |
| 0x13b0a | 1 | < 0.1% |
| 0x13af2 | 1 | < 0.1% |
| 0x13af3 | 1 | < 0.1% |
| 0x13af4 | 1 | < 0.1% |
| 0x13af5 | 1 | < 0.1% |
| 0x13afa | 1 | < 0.1% |
| 0x13afb | 1 | < 0.1% |
| 0x13afc | 1 | < 0.1% |
| 0x13afd | 1 | < 0.1% |
| Other values (149990) | 149990 |
Length
| Value | Count | Frequency (%) |
| 0x160a | 1 | < 0.1% |
| 0x1631 | 1 | < 0.1% |
| 0x163c | 1 | < 0.1% |
| 0x163b | 1 | < 0.1% |
| 0x1622 | 1 | < 0.1% |
| 0x160c | 1 | < 0.1% |
| 0x160d | 1 | < 0.1% |
| 0x1616 | 1 | < 0.1% |
| 0x1617 | 1 | < 0.1% |
| 0x1618 | 1 | < 0.1% |
| Other values (149990) | 149990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 186157 | |
| x | 150000 | |
| 1 | 104253 | |
| 2 | 64817 | 6.5% |
| 3 | 40255 | 4.1% |
| 4 | 40255 | 4.1% |
| 5 | 40241 | 4.1% |
| a | 36415 | 3.7% |
| b | 36415 | 3.7% |
| c | 36415 | 3.7% |
| Other values (7) | 254875 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 621636 | |
| Lowercase Letter | 368462 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 186157 | |
| 1 | 104253 | |
| 2 | 64817 | 10.4% |
| 3 | 40255 | 6.5% |
| 4 | 40255 | 6.5% |
| 5 | 40241 | 6.5% |
| 7 | 36415 | 5.9% |
| 8 | 36415 | 5.9% |
| 9 | 36415 | 5.9% |
| 6 | 36413 | 5.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 150000 | |
| a | 36415 | 9.9% |
| b | 36415 | 9.9% |
| c | 36415 | 9.9% |
| d | 36415 | 9.9% |
| e | 36415 | 9.9% |
| f | 36387 | 9.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 621636 | |
| Latin | 368462 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 186157 | |
| 1 | 104253 | |
| 2 | 64817 | 10.4% |
| 3 | 40255 | 6.5% |
| 4 | 40255 | 6.5% |
| 5 | 40241 | 6.5% |
| 7 | 36415 | 5.9% |
| 8 | 36415 | 5.9% |
| 9 | 36415 | 5.9% |
| 6 | 36413 | 5.9% |
Latin
| Value | Count | Frequency (%) |
| x | 150000 | |
| a | 36415 | 9.9% |
| b | 36415 | 9.9% |
| c | 36415 | 9.9% |
| d | 36415 | 9.9% |
| e | 36415 | 9.9% |
| f | 36387 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 990098 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 186157 | |
| x | 150000 | |
| 1 | 104253 | |
| 2 | 64817 | 6.5% |
| 3 | 40255 | 4.1% |
| 4 | 40255 | 4.1% |
| 5 | 40241 | 4.1% |
| a | 36415 | 3.7% |
| b | 36415 | 3.7% |
| c | 36415 | 3.7% |
| Other values (7) | 254875 |
Customer_ID
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 12500 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| CUS_0xd40 | 12 |
|---|---|
| CUS_0x9bf4 | 12 |
| CUS_0x5ae3 | 12 |
| CUS_0xbe9a | 12 |
| CUS_0x4874 | 12 |
| Other values (12495) |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.93952 |
| Min length | 9 |
Characters and Unicode
| Total characters | 1490928 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CUS_0xd40 |
|---|---|
| 2nd row | CUS_0xd40 |
| 3rd row | CUS_0xd40 |
| 4th row | CUS_0xd40 |
| 5th row | CUS_0x21b1 |
Common Values
| Value | Count | Frequency (%) |
| CUS_0xd40 | 12 | < 0.1% |
| CUS_0x9bf4 | 12 | < 0.1% |
| CUS_0x5ae3 | 12 | < 0.1% |
| CUS_0xbe9a | 12 | < 0.1% |
| CUS_0x4874 | 12 | < 0.1% |
| CUS_0xc67b | 12 | < 0.1% |
| CUS_0x8a64 | 12 | < 0.1% |
| CUS_0x35ea | 12 | < 0.1% |
| CUS_0x5044 | 12 | < 0.1% |
| CUS_0x9dfd | 12 | < 0.1% |
| Other values (12490) | 149880 |
Length
| Value | Count | Frequency (%) |
| cus_0xd40 | 12 | < 0.1% |
| cus_0x75c6 | 12 | < 0.1% |
| cus_0x5b48 | 12 | < 0.1% |
| cus_0xc0ab | 12 | < 0.1% |
| cus_0x2dbc | 12 | < 0.1% |
| cus_0xb891 | 12 | < 0.1% |
| cus_0x1cdb | 12 | < 0.1% |
| cus_0x95ee | 12 | < 0.1% |
| cus_0x284a | 12 | < 0.1% |
| cus_0x5407 | 12 | < 0.1% |
| Other values (12490) | 149880 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 177372 | |
| C | 150000 | 10.1% |
| S | 150000 | 10.1% |
| _ | 150000 | 10.1% |
| x | 150000 | 10.1% |
| U | 150000 | 10.1% |
| 4 | 42000 | 2.8% |
| 6 | 41100 | 2.8% |
| 5 | 40800 | 2.7% |
| 3 | 40608 | 2.7% |
| Other values (11) | 399048 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 541812 | |
| Uppercase Letter | 450000 | |
| Lowercase Letter | 349116 | |
| Connector Punctuation | 150000 | 10.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 177372 | |
| 4 | 42000 | 7.8% |
| 6 | 41100 | 7.6% |
| 5 | 40800 | 7.5% |
| 3 | 40608 | 7.5% |
| 8 | 40608 | 7.5% |
| 7 | 40164 | 7.4% |
| 9 | 40104 | 7.4% |
| 2 | 40080 | 7.4% |
| 1 | 38976 | 7.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 150000 | |
| b | 40200 | 11.5% |
| a | 39816 | 11.4% |
| c | 33408 | 9.6% |
| e | 29232 | 8.4% |
| d | 28308 | 8.1% |
| f | 28152 | 8.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 150000 | |
| S | 150000 | |
| U | 150000 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 150000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 799116 | |
| Common | 691812 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 177372 | |
| _ | 150000 | |
| 4 | 42000 | 6.1% |
| 6 | 41100 | 5.9% |
| 5 | 40800 | 5.9% |
| 3 | 40608 | 5.9% |
| 8 | 40608 | 5.9% |
| 7 | 40164 | 5.8% |
| 9 | 40104 | 5.8% |
| 2 | 40080 | 5.8% |
Latin
| Value | Count | Frequency (%) |
| C | 150000 | |
| S | 150000 | |
| x | 150000 | |
| U | 150000 | |
| b | 40200 | 5.0% |
| a | 39816 | 5.0% |
| c | 33408 | 4.2% |
| e | 29232 | 3.7% |
| d | 28308 | 3.5% |
| f | 28152 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1490928 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 177372 | |
| C | 150000 | 10.1% |
| S | 150000 | 10.1% |
| _ | 150000 | 10.1% |
| x | 150000 | 10.1% |
| U | 150000 | 10.1% |
| 4 | 42000 | 2.8% |
| 6 | 41100 | 2.8% |
| 5 | 40800 | 2.7% |
| 3 | 40608 | 2.7% |
| Other values (11) | 399048 |
Month
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| September | |
|---|---|
| October | |
| November | |
| December | |
| January | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.1666667 |
| Min length | 3 |
Characters and Unicode
| Total characters | 925000 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | September |
|---|---|
| 2nd row | October |
| 3rd row | November |
| 4th row | December |
| 5th row | September |
Common Values
| Value | Count | Frequency (%) |
| September | 12500 | |
| October | 12500 | |
| November | 12500 | |
| December | 12500 | |
| January | 12500 | |
| February | 12500 | |
| March | 12500 | |
| April | 12500 | |
| May | 12500 | |
| June | 12500 | |
| Other values (2) | 25000 |
Length
| Value | Count | Frequency (%) |
| september | 12500 | |
| october | 12500 | |
| november | 12500 | |
| december | 12500 | |
| january | 12500 | |
| february | 12500 | |
| march | 12500 | |
| april | 12500 | |
| may | 12500 | |
| june | 12500 | |
| Other values (2) | 25000 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 137500 | |
| r | 112500 | |
| u | 75000 | 8.1% |
| b | 62500 | 6.8% |
| a | 62500 | 6.8% |
| y | 50000 | 5.4% |
| J | 37500 | 4.1% |
| t | 37500 | 4.1% |
| m | 37500 | 4.1% |
| c | 37500 | 4.1% |
| Other values (16) | 275000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 775000 | |
| Uppercase Letter | 150000 | 16.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 137500 | |
| r | 112500 | |
| u | 75000 | |
| b | 62500 | |
| a | 62500 | |
| y | 50000 | 6.5% |
| t | 37500 | 4.8% |
| m | 37500 | 4.8% |
| c | 37500 | 4.8% |
| n | 25000 | 3.2% |
| Other values (8) | 137500 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 37500 | |
| A | 25000 | |
| M | 25000 | |
| S | 12500 | 8.3% |
| F | 12500 | 8.3% |
| D | 12500 | 8.3% |
| N | 12500 | 8.3% |
| O | 12500 | 8.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 925000 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 137500 | |
| r | 112500 | |
| u | 75000 | 8.1% |
| b | 62500 | 6.8% |
| a | 62500 | 6.8% |
| y | 50000 | 5.4% |
| J | 37500 | 4.1% |
| t | 37500 | 4.1% |
| m | 37500 | 4.1% |
| c | 37500 | 4.1% |
| Other values (16) | 275000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 925000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 137500 | |
| r | 112500 | |
| u | 75000 | 8.1% |
| b | 62500 | 6.8% |
| a | 62500 | 6.8% |
| y | 50000 | 5.4% |
| J | 37500 | 4.1% |
| t | 37500 | 4.1% |
| m | 37500 | 4.1% |
| c | 37500 | 4.1% |
| Other values (16) | 275000 |
Name
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 10139 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 15000 |
| Missing (%) | 10.0% |
| Memory size | 1.1 MiB |
| Stevex | 66 |
|---|---|
| Langep | 65 |
| Jessicad | 59 |
| Vaughanl | 58 |
| Raymondr | 58 |
| Other values (10134) |
Length
| Max length | 25 |
|---|---|
| Median length | 20 |
| Mean length | 9.7659926 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1318409 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aaron Maashoh |
|---|---|
| 2nd row | Aaron Maashoh |
| 3rd row | Aaron Maashoh |
| 4th row | Aaron Maashoh |
| 5th row | Rick Rothackerj |
Common Values
| Value | Count | Frequency (%) |
| Stevex | 66 | < 0.1% |
| Langep | 65 | < 0.1% |
| Jessicad | 59 | < 0.1% |
| Vaughanl | 58 | < 0.1% |
| Raymondr | 58 | < 0.1% |
| Deepa Seetharamanm | 58 | < 0.1% |
| Nicko | 57 | < 0.1% |
| Jessica Wohlt | 57 | < 0.1% |
| Ronald Groverk | 56 | < 0.1% |
| Danielz | 55 | < 0.1% |
| Other values (10129) | 134411 | |
| (Missing) | 15000 | 10.0% |
Length
| Value | Count | Frequency (%) |
| david | 968 | 0.5% |
| jonathan | 913 | 0.5% |
| jessica | 761 | 0.4% |
| sarah | 617 | 0.3% |
| karen | 568 | 0.3% |
| nick | 561 | 0.3% |
| tim | 555 | 0.3% |
| caroline | 553 | 0.3% |
| john | 511 | 0.3% |
| tom | 508 | 0.3% |
| Other values (9720) | 181957 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 137614 | 10.4% |
| e | 114139 | 8.7% |
| n | 88845 | 6.7% |
| i | 87745 | 6.7% |
| r | 81658 | 6.2% |
| o | 66930 | 5.1% |
| l | 63199 | 4.8% |
| 53523 | 4.1% | |
| t | 52504 | 4.0% |
| h | 45818 | 3.5% |
| Other values (47) | 526434 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1071504 | |
| Uppercase Letter | 187988 | 14.3% |
| Space Separator | 53523 | 4.1% |
| Other Punctuation | 3222 | 0.2% |
| Dash Punctuation | 2172 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 137614 | |
| e | 114139 | 10.7% |
| n | 88845 | 8.3% |
| i | 87745 | 8.2% |
| r | 81658 | 7.6% |
| o | 66930 | 6.2% |
| l | 63199 | 5.9% |
| t | 52504 | 4.9% |
| h | 45818 | 4.3% |
| s | 45628 | 4.3% |
| Other values (16) | 287424 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 21401 | 11.4% |
| A | 13103 | 7.0% |
| M | 12985 | 6.9% |
| L | 12584 | 6.7% |
| J | 11986 | 6.4% |
| C | 11670 | 6.2% |
| R | 10719 | 5.7% |
| D | 10512 | 5.6% |
| K | 10341 | 5.5% |
| B | 9782 | 5.2% |
| Other values (16) | 62905 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1649 | |
| " | 1444 | |
| , | 129 | 4.0% |
Space Separator
| Value | Count | Frequency (%) |
| 53523 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2172 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1259492 | |
| Common | 58917 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 137614 | 10.9% |
| e | 114139 | 9.1% |
| n | 88845 | 7.1% |
| i | 87745 | 7.0% |
| r | 81658 | 6.5% |
| o | 66930 | 5.3% |
| l | 63199 | 5.0% |
| t | 52504 | 4.2% |
| h | 45818 | 3.6% |
| s | 45628 | 3.6% |
| Other values (42) | 475412 |
Common
| Value | Count | Frequency (%) |
| 53523 | ||
| - | 2172 | 3.7% |
| . | 1649 | 2.8% |
| " | 1444 | 2.5% |
| , | 129 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1318409 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 137614 | 10.4% |
| e | 114139 | 8.7% |
| n | 88845 | 6.7% |
| i | 87745 | 6.7% |
| r | 81658 | 6.2% |
| o | 66930 | 5.1% |
| l | 63199 | 4.8% |
| 53523 | 4.1% | |
| t | 52504 | 4.0% |
| h | 45818 | 3.5% |
| Other values (47) | 526434 |
Age
Categorical
| Distinct | 2524 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 39 | 4198 |
|---|---|
| 32 | 4189 |
| 28 | 4173 |
| 26 | 4140 |
| 35 | 4130 |
| Other values (2519) |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.1030733 |
| Min length | 2 |
Characters and Unicode
| Total characters | 315461 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2084 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 23 |
|---|---|
| 2nd row | 24 |
| 3rd row | 24 |
| 4th row | 24_ |
| 5th row | 28 |
Common Values
| Value | Count | Frequency (%) |
| 39 | 4198 | 2.8% |
| 32 | 4189 | 2.8% |
| 28 | 4173 | 2.8% |
| 26 | 4140 | 2.8% |
| 35 | 4130 | 2.8% |
| 44 | 4116 | 2.7% |
| 38 | 4099 | 2.7% |
| 27 | 4089 | 2.7% |
| 31 | 4071 | 2.7% |
| 22 | 4063 | 2.7% |
| Other values (2514) | 108732 |
Length
| Value | Count | Frequency (%) |
| 39 | 4416 | 2.9% |
| 32 | 4413 | 2.9% |
| 28 | 4383 | 2.9% |
| 26 | 4366 | 2.9% |
| 35 | 4349 | 2.9% |
| 38 | 4334 | 2.9% |
| 44 | 4324 | 2.9% |
| 27 | 4316 | 2.9% |
| 31 | 4287 | 2.9% |
| 22 | 4278 | 2.9% |
| Other values (2434) | 106534 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 57829 | |
| 3 | 57493 | |
| 4 | 50047 | |
| 5 | 32300 | |
| 1 | 31218 | |
| 0 | 17664 | 5.6% |
| 6 | 15759 | 5.0% |
| 9 | 15624 | 5.0% |
| 8 | 14972 | 4.7% |
| 7 | 13789 | 4.4% |
| Other values (2) | 8766 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 306695 | |
| Connector Punctuation | 7416 | 2.4% |
| Dash Punctuation | 1350 | 0.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 57829 | |
| 3 | 57493 | |
| 4 | 50047 | |
| 5 | 32300 | |
| 1 | 31218 | |
| 0 | 17664 | 5.8% |
| 6 | 15759 | 5.1% |
| 9 | 15624 | 5.1% |
| 8 | 14972 | 4.9% |
| 7 | 13789 | 4.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7416 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1350 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 315461 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 57829 | |
| 3 | 57493 | |
| 4 | 50047 | |
| 5 | 32300 | |
| 1 | 31218 | |
| 0 | 17664 | 5.6% |
| 6 | 15759 | 5.0% |
| 9 | 15624 | 5.0% |
| 8 | 14972 | 4.7% |
| 7 | 13789 | 4.4% |
| Other values (2) | 8766 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 315461 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 57829 | |
| 3 | 57493 | |
| 4 | 50047 | |
| 5 | 32300 | |
| 1 | 31218 | |
| 0 | 17664 | 5.6% |
| 6 | 15759 | 5.0% |
| 9 | 15624 | 5.0% |
| 8 | 14972 | 4.7% |
| 7 | 13789 | 4.4% |
| Other values (2) | 8766 | 2.8% |
SSN
Categorical
| Distinct | 12501 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| #F%$D@*&8 | 8400 |
|---|---|
| 078-73-5990 | 12 |
| 374-03-0670 | 12 |
| 255-39-8777 | 12 |
| 866-11-3352 | 12 |
| Other values (12496) |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.888 |
| Min length | 9 |
Characters and Unicode
| Total characters | 1633200 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 821-00-0265 |
|---|---|
| 2nd row | 821-00-0265 |
| 3rd row | 821-00-0265 |
| 4th row | 821-00-0265 |
| 5th row | 004-07-5839 |
Common Values
| Value | Count | Frequency (%) |
| #F%$D@*&8 | 8400 | 5.6% |
| 078-73-5990 | 12 | < 0.1% |
| 374-03-0670 | 12 | < 0.1% |
| 255-39-8777 | 12 | < 0.1% |
| 866-11-3352 | 12 | < 0.1% |
| 159-51-7992 | 12 | < 0.1% |
| 318-30-9160 | 12 | < 0.1% |
| 259-11-0934 | 12 | < 0.1% |
| 162-17-7776 | 12 | < 0.1% |
| 557-07-3973 | 12 | < 0.1% |
| Other values (12491) | 141492 |
Length
| Value | Count | Frequency (%) |
| f%$d@*&8 | 8400 | 5.6% |
| 741-04-8469 | 12 | < 0.1% |
| 213-49-2021 | 12 | < 0.1% |
| 996-52-9835 | 12 | < 0.1% |
| 978-19-7269 | 12 | < 0.1% |
| 056-57-6013 | 12 | < 0.1% |
| 302-82-0750 | 12 | < 0.1% |
| 174-73-2790 | 12 | < 0.1% |
| 159-72-2454 | 12 | < 0.1% |
| 820-41-6857 | 12 | < 0.1% |
| Other values (12491) | 141492 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 283200 | |
| 8 | 137348 | |
| 1 | 129790 | |
| 4 | 128788 | |
| 2 | 127869 | |
| 7 | 127708 | |
| 0 | 127016 | |
| 9 | 126945 | |
| 5 | 126837 | |
| 3 | 125445 | |
| Other values (9) | 192254 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1282800 | |
| Dash Punctuation | 283200 | 17.3% |
| Other Punctuation | 42000 | 2.6% |
| Uppercase Letter | 16800 | 1.0% |
| Currency Symbol | 8400 | 0.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 137348 | |
| 1 | 129790 | |
| 4 | 128788 | |
| 2 | 127869 | |
| 7 | 127708 | |
| 0 | 127016 | |
| 9 | 126945 | |
| 5 | 126837 | |
| 3 | 125445 | |
| 6 | 125054 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 8400 | |
| * | 8400 | |
| @ | 8400 | |
| % | 8400 | |
| # | 8400 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 8400 | |
| D | 8400 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 283200 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 8400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1616400 | |
| Latin | 16800 | 1.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 283200 | |
| 8 | 137348 | |
| 1 | 129790 | |
| 4 | 128788 | |
| 2 | 127869 | |
| 7 | 127708 | |
| 0 | 127016 | |
| 9 | 126945 | |
| 5 | 126837 | |
| 3 | 125445 | |
| Other values (7) | 175454 |
Latin
| Value | Count | Frequency (%) |
| F | 8400 | |
| D | 8400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1633200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 283200 | |
| 8 | 137348 | |
| 1 | 129790 | |
| 4 | 128788 | |
| 2 | 127869 | |
| 7 | 127708 | |
| 0 | 127016 | |
| 9 | 126945 | |
| 5 | 126837 | |
| 3 | 125445 | |
| Other values (9) | 192254 |
Occupation
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| _______ | |
|---|---|
| Lawyer | 9899 |
| Engineer | 9562 |
| Architect | 9550 |
| Mechanic | 9459 |
| Other values (11) |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.43234 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1264851 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Scientist |
|---|---|
| 2nd row | Scientist |
| 3rd row | Scientist |
| 4th row | Scientist |
| 5th row | _______ |
Common Values
| Value | Count | Frequency (%) |
| _______ | 10500 | 7.0% |
| Lawyer | 9899 | 6.6% |
| Engineer | 9562 | 6.4% |
| Architect | 9550 | 6.4% |
| Mechanic | 9459 | 6.3% |
| Accountant | 9404 | 6.3% |
| Scientist | 9403 | 6.3% |
| Developer | 9381 | 6.3% |
| Media_Manager | 9362 | 6.2% |
| Teacher | 9318 | 6.2% |
| Other values (6) | 54162 |
Length
| Value | Count | Frequency (%) |
| 10500 | 7.0% | |
| lawyer | 9899 | 6.6% |
| engineer | 9562 | 6.4% |
| architect | 9550 | 6.4% |
| mechanic | 9459 | 6.3% |
| accountant | 9404 | 6.3% |
| scientist | 9403 | 6.3% |
| developer | 9381 | 6.3% |
| media_manager | 9362 | 6.2% |
| teacher | 9318 | 6.2% |
| Other values (6) | 54162 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 168560 | |
| r | 129748 | |
| n | 111663 | 8.8% |
| a | 102092 | 8.1% |
| c | 93519 | 7.4% |
| t | 93045 | 7.4% |
| i | 92395 | 7.3% |
| _ | 82862 | 6.6% |
| o | 46135 | 3.6% |
| M | 46014 | 3.6% |
| Other values (18) | 298818 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1033127 | |
| Uppercase Letter | 148862 | 11.8% |
| Connector Punctuation | 82862 | 6.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 168560 | |
| r | 129748 | |
| n | 111663 | |
| a | 102092 | |
| c | 93519 | |
| t | 93045 | |
| i | 92395 | |
| o | 46135 | 4.5% |
| u | 36661 | 3.5% |
| h | 28327 | 2.7% |
| Other values (8) | 130982 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 46014 | |
| A | 18954 | |
| E | 18839 | |
| D | 18495 | |
| L | 9899 | 6.6% |
| S | 9403 | 6.3% |
| T | 9318 | 6.3% |
| J | 9122 | 6.1% |
| W | 8818 | 5.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 82862 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1181989 | |
| Common | 82862 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 168560 | |
| r | 129748 | |
| n | 111663 | |
| a | 102092 | 8.6% |
| c | 93519 | 7.9% |
| t | 93045 | 7.9% |
| i | 92395 | 7.8% |
| o | 46135 | 3.9% |
| M | 46014 | 3.9% |
| u | 36661 | 3.1% |
| Other values (17) | 262157 |
Common
| Value | Count | Frequency (%) |
| _ | 82862 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1264851 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 168560 | |
| r | 129748 | |
| n | 111663 | 8.8% |
| a | 102092 | 8.1% |
| c | 93519 | 7.4% |
| t | 93045 | 7.4% |
| i | 92395 | 7.3% |
| _ | 82862 | 6.6% |
| o | 46135 | 3.6% |
| M | 46014 | 3.6% |
| Other values (18) | 298818 |
Annual_Income
Categorical
| Distinct | 21192 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 36585.12 | 24 |
|---|---|
| 9141.63 | 23 |
| 95596.35 | 23 |
| 20867.67 | 23 |
| 17816.75 | 23 |
| Other values (21187) |
Length
| Max length | 19 |
|---|---|
| Median length | 8 |
| Mean length | 8.3092267 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1246384 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6190 ? |
|---|---|
| Unique (%) | 4.1% |
Sample
| 1st row | 19114.12 |
|---|---|
| 2nd row | 19114.12 |
| 3rd row | 19114.12 |
| 4th row | 19114.12 |
| 5th row | 34847.84 |
Common Values
| Value | Count | Frequency (%) |
| 36585.12 | 24 | < 0.1% |
| 9141.63 | 23 | < 0.1% |
| 95596.35 | 23 | < 0.1% |
| 20867.67 | 23 | < 0.1% |
| 17816.75 | 23 | < 0.1% |
| 33029.66 | 22 | < 0.1% |
| 72524.2 | 22 | < 0.1% |
| 109945.32 | 22 | < 0.1% |
| 17273.83 | 22 | < 0.1% |
| 22434.16 | 21 | < 0.1% |
| Other values (21182) | 149775 |
Length
| Value | Count | Frequency (%) |
| 36585.12 | 24 | < 0.1% |
| 109945.32 | 24 | < 0.1% |
| 9141.63 | 24 | < 0.1% |
| 32543.38 | 24 | < 0.1% |
| 22434.16 | 24 | < 0.1% |
| 17273.83 | 24 | < 0.1% |
| 40341.16 | 24 | < 0.1% |
| 17816.75 | 24 | < 0.1% |
| 20867.67 | 24 | < 0.1% |
| 72524.2 | 23 | < 0.1% |
| Other values (13978) | 149761 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 150000 | |
| 1 | 139224 | |
| 2 | 114510 | |
| 4 | 107657 | |
| 3 | 107443 | |
| 8 | 106304 | |
| 5 | 106050 | |
| 6 | 105985 | |
| 9 | 103560 | |
| 0 | 99513 | |
| Other values (2) | 106138 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1085884 | |
| Other Punctuation | 150000 | 12.0% |
| Connector Punctuation | 10500 | 0.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 139224 | |
| 2 | 114510 | |
| 4 | 107657 | |
| 3 | 107443 | |
| 8 | 106304 | |
| 5 | 106050 | |
| 6 | 105985 | |
| 9 | 103560 | |
| 0 | 99513 | |
| 7 | 95638 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 150000 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 10500 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1246384 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 150000 | |
| 1 | 139224 | |
| 2 | 114510 | |
| 4 | 107657 | |
| 3 | 107443 | |
| 8 | 106304 | |
| 5 | 106050 | |
| 6 | 105985 | |
| 9 | 103560 | |
| 0 | 99513 | |
| Other values (2) | 106138 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1246384 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 150000 | |
| 1 | 139224 | |
| 2 | 114510 | |
| 4 | 107657 | |
| 3 | 107443 | |
| 8 | 106304 | |
| 5 | 106050 | |
| 6 | 105985 | |
| 9 | 103560 | |
| 0 | 99513 | |
| Other values (2) | 106138 |
Monthly_Inhand_Salary
Real number (ℝ)
| Distinct | 13683 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 22500 |
| Missing (%) | 15.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4190.1151 |
| Minimum | 303.64542 |
|---|---|
| Maximum | 15204.633 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 303.64542 |
|---|---|
| 5-th percentile | 835.80904 |
| Q1 | 1625.2658 |
| median | 3091 |
| Q3 | 5948.4546 |
| 95-th percentile | 10812.433 |
| Maximum | 15204.633 |
| Range | 14900.988 |
| Interquartile range (IQR) | 4323.1888 |
Descriptive statistics
| Standard deviation | 3180.4897 |
|---|---|
| Coefficient of variation (CV) | 0.75904589 |
| Kurtosis | 0.61802104 |
| Mean | 4190.1151 |
| Median Absolute Deviation (MAD) | 1754.03 |
| Skewness | 1.1286314 |
| Sum | 5.3423968 × 108 |
| Variance | 10115514 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2295.058333 | 22 | < 0.1% |
| 6082.1875 | 22 | < 0.1% |
| 3080.555 | 21 | < 0.1% |
| 6358.956667 | 21 | < 0.1% |
| 6639.56 | 20 | < 0.1% |
| 4387.2725 | 20 | < 0.1% |
| 5766.491667 | 20 | < 0.1% |
| 1315.560833 | 19 | < 0.1% |
| 6769.13 | 19 | < 0.1% |
| 536.43125 | 19 | < 0.1% |
| Other values (13673) | 127297 | |
| (Missing) | 22500 | 15.0% |
| Value | Count | Frequency (%) |
| 303.6454167 | 10 | |
| 319.55625 | 11 | |
| 331.0319233 | 2 | < 0.1% |
| 332.1283333 | 10 | |
| 332.43125 | 10 | |
| 333.5966667 | 10 | |
| 355.2083333 | 12 | |
| 357.2558333 | 11 | |
| 358.0583333 | 10 | |
| 361.6033333 | 10 |
| Value | Count | Frequency (%) |
| 15204.63333 | 10 | |
| 15167.18 | 12 | |
| 15136.69667 | 10 | |
| 15115.19 | 10 | |
| 15101.94 | 11 | |
| 15091.08667 | 5 | |
| 15090.07667 | 11 | |
| 15066.78333 | 11 | |
| 15038.31667 | 5 | |
| 14978.33667 | 10 |
Num_Bank_Accounts
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 1183 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.00694 |
| Minimum | -1 |
|---|---|
| Maximum | 1798 |
| Zeros | 6494 |
| Zeros (%) | 4.3% |
| Negative | 37 |
| Negative (%) | < 0.1% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 6 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 1798 |
| Range | 1799 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 117.06948 |
|---|---|
| Coefficient of variation (CV) | 6.8836296 |
| Kurtosis | 132.64797 |
| Mean | 17.00694 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 11.218773 |
| Sum | 2551041 |
| Variance | 13705.262 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 19505 | |
| 7 | 19231 | |
| 8 | 19152 | |
| 4 | 18286 | |
| 5 | 18186 | |
| 3 | 17905 | |
| 9 | 8181 | |
| 10 | 7846 | |
| 1 | 6743 | 4.5% |
| 0 | 6494 | 4.3% |
| Other values (1173) | 8471 |
| Value | Count | Frequency (%) |
| -1 | 37 | < 0.1% |
| 0 | 6494 | 4.3% |
| 1 | 6743 | 4.5% |
| 2 | 6456 | 4.3% |
| 3 | 17905 | |
| 4 | 18286 | |
| 5 | 18186 | |
| 6 | 19505 | |
| 7 | 19231 | |
| 8 | 19152 |
| Value | Count | Frequency (%) |
| 1798 | 3 | |
| 1794 | 2 | |
| 1793 | 1 | < 0.1% |
| 1789 | 2 | |
| 1786 | 1 | < 0.1% |
| 1784 | 2 | |
| 1783 | 2 | |
| 1782 | 1 | < 0.1% |
| 1781 | 1 | < 0.1% |
| 1780 | 1 | < 0.1% |
Num_Credit_Card
Real number (ℝ)
| Distinct | 1344 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.623447 |
| Minimum | 0 |
|---|---|
| Maximum | 1499 |
| Zeros | 29 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 10 |
| Maximum | 1499 |
| Range | 1499 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 129.14301 |
|---|---|
| Coefficient of variation (CV) | 5.7083701 |
| Kurtosis | 73.645489 |
| Mean | 22.623447 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 8.4006471 |
| Sum | 3393517 |
| Variance | 16677.916 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 27669 | |
| 7 | 24886 | |
| 6 | 24802 | |
| 4 | 21102 | |
| 3 | 19816 | |
| 8 | 7453 | 5.0% |
| 10 | 7265 | 4.8% |
| 9 | 6976 | 4.7% |
| 2 | 3280 | 2.2% |
| 1 | 3195 | 2.1% |
| Other values (1334) | 3556 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 29 | < 0.1% |
| 1 | 3195 | 2.1% |
| 2 | 3280 | 2.2% |
| 3 | 19816 | |
| 4 | 21102 | |
| 5 | 27669 | |
| 6 | 24802 | |
| 7 | 24886 | |
| 8 | 7453 | 5.0% |
| 9 | 6976 | 4.7% |
| Value | Count | Frequency (%) |
| 1499 | 3 | |
| 1498 | 5 | |
| 1497 | 3 | |
| 1496 | 2 | < 0.1% |
| 1495 | 2 | < 0.1% |
| 1494 | 1 | < 0.1% |
| 1493 | 2 | < 0.1% |
| 1492 | 2 | < 0.1% |
| 1491 | 1 | < 0.1% |
| 1490 | 2 | < 0.1% |
Interest_Rate
Real number (ℝ)
| Distinct | 2394 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 71.234907 |
| Minimum | 1 |
|---|---|
| Maximum | 5799 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 13 |
| Q3 | 20 |
| 95-th percentile | 32 |
| Maximum | 5799 |
| Range | 5798 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 461.53719 |
|---|---|
| Coefficient of variation (CV) | 6.4790875 |
| Kurtosis | 87.492459 |
| Mean | 71.234907 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 9.1228759 |
| Sum | 10685236 |
| Variance | 213016.58 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 7515 | 5.0% |
| 5 | 7479 | 5.0% |
| 6 | 7089 | 4.7% |
| 12 | 6828 | 4.6% |
| 10 | 6799 | 4.5% |
| 9 | 6747 | 4.5% |
| 7 | 6744 | 4.5% |
| 11 | 6626 | 4.4% |
| 18 | 6154 | 4.1% |
| 15 | 5984 | 4.0% |
| Other values (2384) | 82035 |
| Value | Count | Frequency (%) |
| 1 | 4027 | |
| 2 | 3710 | |
| 3 | 4153 | |
| 4 | 3876 | |
| 5 | 7479 | |
| 6 | 7089 | |
| 7 | 6744 | |
| 8 | 7515 | |
| 9 | 6747 | |
| 10 | 6799 |
| Value | Count | Frequency (%) |
| 5799 | 1 | |
| 5797 | 1 | |
| 5792 | 1 | |
| 5789 | 1 | |
| 5788 | 1 | |
| 5776 | 1 | |
| 5775 | 1 | |
| 5774 | 1 | |
| 5773 | 2 | |
| 5771 | 1 |
Num_of_Loan
Categorical
HIGH CARDINALITY  IMBALANCE 
| Distinct | 623 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 3 | |
|---|---|
| 2 | |
| 4 | |
| 0 | |
| 1 | |
| Other values (618) |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.1762333 |
| Min length | 1 |
Characters and Unicode
| Total characters | 176435 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 489 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 4 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 21500 | |
| 2 | 21423 | |
| 4 | 20998 | |
| 0 | 15543 | |
| 1 | 15112 | |
| 6 | 11112 | |
| 7 | 10413 | |
| 5 | 10302 | |
| -100 | 5850 | 3.9% |
| 9 | 5288 | 3.5% |
| Other values (613) | 12459 |
Length
| Value | Count | Frequency (%) |
| 3 | 22618 | |
| 2 | 22547 | |
| 4 | 22111 | |
| 0 | 16376 | |
| 1 | 15901 | |
| 6 | 11705 | |
| 7 | 11024 | |
| 5 | 10814 | |
| 100 | 5851 | 3.9% |
| 9 | 5539 | 3.7% |
| Other values (589) | 5514 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 28253 | |
| 3 | 22866 | |
| 2 | 22796 | |
| 4 | 22375 | |
| 1 | 22250 | |
| 6 | 11891 | |
| 7 | 11198 | 6.3% |
| 5 | 11023 | 6.2% |
| _ | 7221 | 4.1% |
| - | 5850 | 3.3% |
| Other values (2) | 10712 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 163364 | |
| Connector Punctuation | 7221 | 4.1% |
| Dash Punctuation | 5850 | 3.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 28253 | |
| 3 | 22866 | |
| 2 | 22796 | |
| 4 | 22375 | |
| 1 | 22250 | |
| 6 | 11891 | |
| 7 | 11198 | 6.9% |
| 5 | 11023 | 6.7% |
| 9 | 5747 | 3.5% |
| 8 | 4965 | 3.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7221 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5850 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 176435 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 28253 | |
| 3 | 22866 | |
| 2 | 22796 | |
| 4 | 22375 | |
| 1 | 22250 | |
| 6 | 11891 | |
| 7 | 11198 | 6.3% |
| 5 | 11023 | 6.2% |
| _ | 7221 | 4.1% |
| - | 5850 | 3.3% |
| Other values (2) | 10712 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 176435 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 28253 | |
| 3 | 22866 | |
| 2 | 22796 | |
| 4 | 22375 | |
| 1 | 22250 | |
| 6 | 11891 | |
| 7 | 11198 | 6.3% |
| 5 | 11023 | 6.2% |
| _ | 7221 | 4.1% |
| - | 5850 | 3.3% |
| Other values (2) | 10712 | 6.1% |
Type_of_Loan
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 6260 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 17112 |
| Missing (%) | 11.4% |
| Memory size | 1.1 MiB |
| Not Specified | 2112 |
|---|---|
| Credit-Builder Loan | 1920 |
| Personal Loan | 1908 |
| Debt Consolidation Loan | 1896 |
| Student Loan | 1860 |
| Other values (6255) |
Length
| Max length | 182 |
|---|---|
| Median length | 142 |
| Mean length | 66.683583 |
| Min length | 9 |
Characters and Unicode
| Total characters | 8861448 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
|---|---|
| 2nd row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
| 3rd row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
| 4th row | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan |
| 5th row | Credit-Builder Loan |
Common Values
| Value | Count | Frequency (%) |
| Not Specified | 2112 | 1.4% |
| Credit-Builder Loan | 1920 | 1.3% |
| Personal Loan | 1908 | 1.3% |
| Debt Consolidation Loan | 1896 | 1.3% |
| Student Loan | 1860 | 1.2% |
| Payday Loan | 1800 | 1.2% |
| Mortgage Loan | 1764 | 1.2% |
| Auto Loan | 1728 | 1.2% |
| Home Equity Loan | 1704 | 1.1% |
| Personal Loan, and Student Loan | 480 | 0.3% |
| Other values (6250) | 115716 | |
| (Missing) | 17112 | 11.4% |
Length
| Value | Count | Frequency (%) |
| loan | 470508 | |
| and | 116196 | 9.0% |
| payday | 60852 | 4.7% |
| credit-builder | 60660 | 4.7% |
| not | 59424 | 4.6% |
| specified | 59424 | 4.6% |
| home | 58656 | 4.5% |
| equity | 58656 | 4.5% |
| student | 58452 | 4.5% |
| mortgage | 58404 | 4.5% |
| Other values (4) | 231648 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1159992 | ||
| o | 936804 | |
| a | 883308 | 10.0% |
| n | 819816 | 9.3% |
| e | 532176 | 6.0% |
| t | 527364 | 6.0% |
| d | 474408 | 5.4% |
| L | 470508 | 5.3% |
| i | 415152 | 4.7% |
| , | 397044 | 4.5% |
| Other values (23) | 2244876 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6006408 | |
| Uppercase Letter | 1237344 | 14.0% |
| Space Separator | 1159992 | 13.1% |
| Other Punctuation | 397044 | 4.5% |
| Dash Punctuation | 60660 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 936804 | |
| a | 883308 | |
| n | 819816 | |
| e | 532176 | |
| t | 527364 | |
| d | 474408 | |
| i | 415152 | |
| r | 238056 | 4.0% |
| u | 234756 | 3.9% |
| y | 180360 | 3.0% |
| Other values (9) | 764208 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 470508 | |
| P | 119184 | 9.6% |
| C | 118824 | 9.6% |
| S | 117876 | 9.5% |
| B | 60660 | 4.9% |
| N | 59424 | 4.8% |
| H | 58656 | 4.7% |
| E | 58656 | 4.7% |
| M | 58404 | 4.7% |
| D | 58164 | 4.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1159992 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 397044 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 60660 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7243752 | |
| Common | 1617696 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 936804 | |
| a | 883308 | |
| n | 819816 | |
| e | 532176 | 7.3% |
| t | 527364 | 7.3% |
| d | 474408 | 6.5% |
| L | 470508 | 6.5% |
| i | 415152 | 5.7% |
| r | 238056 | 3.3% |
| u | 234756 | 3.2% |
| Other values (20) | 1711404 |
Common
| Value | Count | Frequency (%) |
| 1159992 | ||
| , | 397044 | 24.5% |
| - | 60660 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8861448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1159992 | ||
| o | 936804 | |
| a | 883308 | 10.0% |
| n | 819816 | 9.3% |
| e | 532176 | 6.0% |
| t | 527364 | 6.0% |
| d | 474408 | 5.4% |
| L | 470508 | 5.3% |
| i | 415152 | 4.7% |
| , | 397044 | 4.5% |
| Other values (23) | 2244876 |
Delay_from_due_date
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 73 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.0634 |
| Minimum | -5 |
|---|---|
| Maximum | 67 |
| Zeros | 1821 |
| Zeros (%) | 1.2% |
| Negative | 889 |
| Negative (%) | 0.6% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | -5 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 10 |
| median | 18 |
| Q3 | 28 |
| 95-th percentile | 54 |
| Maximum | 67 |
| Range | 72 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 14.860154 |
|---|---|
| Coefficient of variation (CV) | 0.70549647 |
| Kurtosis | 0.3469547 |
| Mean | 21.0634 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.96589567 |
| Sum | 3159510 |
| Variance | 220.82419 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 15 | 5355 | 3.6% |
| 13 | 5185 | 3.5% |
| 8 | 5004 | 3.3% |
| 14 | 4949 | 3.3% |
| 10 | 4926 | 3.3% |
| 9 | 4889 | 3.3% |
| 7 | 4821 | 3.2% |
| 12 | 4766 | 3.2% |
| 11 | 4755 | 3.2% |
| 6 | 4721 | 3.1% |
| Other values (63) | 100629 |
| Value | Count | Frequency (%) |
| -5 | 51 | < 0.1% |
| -4 | 111 | 0.1% |
| -3 | 177 | 0.1% |
| -2 | 239 | 0.2% |
| -1 | 311 | 0.2% |
| 0 | 1821 | |
| 1 | 1994 | |
| 2 | 2011 | |
| 3 | 2534 | |
| 4 | 2547 |
| Value | Count | Frequency (%) |
| 67 | 29 | < 0.1% |
| 66 | 44 | < 0.1% |
| 65 | 86 | 0.1% |
| 64 | 97 | 0.1% |
| 63 | 90 | 0.1% |
| 62 | 824 | |
| 61 | 785 | |
| 60 | 792 | |
| 59 | 757 | |
| 58 | 835 |
Num_of_Delayed_Payment
Categorical
HIGH CARDINALITY  IMBALANCE  MISSING 
| Distinct | 1058 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 10500 |
| Missing (%) | 7.0% |
| Memory size | 1.1 MiB |
| 19 | 7949 |
|---|---|
| 17 | 7806 |
| 16 | 7721 |
| 15 | 7671 |
| 10 | 7670 |
| Other values (1053) |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 1.7707097 |
| Min length | 1 |
Characters and Unicode
| Total characters | 247014 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 867 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 7 |
|---|---|
| 2nd row | 9 |
| 3rd row | 4 |
| 4th row | 5 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 19 | 7949 | 5.3% |
| 17 | 7806 | 5.2% |
| 16 | 7721 | 5.1% |
| 15 | 7671 | 5.1% |
| 10 | 7670 | 5.1% |
| 18 | 7653 | 5.1% |
| 12 | 7388 | 4.9% |
| 20 | 7357 | 4.9% |
| 9 | 7199 | 4.8% |
| 11 | 7107 | 4.7% |
| Other values (1048) | 63979 | |
| (Missing) | 10500 | 7.0% |
Length
| Value | Count | Frequency (%) |
| 19 | 8188 | 5.9% |
| 17 | 8048 | 5.8% |
| 16 | 7949 | 5.7% |
| 15 | 7911 | 5.7% |
| 10 | 7900 | 5.7% |
| 18 | 7847 | 5.6% |
| 12 | 7622 | 5.5% |
| 20 | 7607 | 5.5% |
| 9 | 7421 | 5.3% |
| 11 | 7314 | 5.2% |
| Other values (1001) | 61693 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 89916 | |
| 2 | 38954 | |
| 0 | 18253 | 7.4% |
| 9 | 15941 | 6.5% |
| 8 | 15700 | 6.4% |
| 5 | 13902 | 5.6% |
| 3 | 12780 | 5.2% |
| 7 | 12291 | 5.0% |
| 6 | 12184 | 4.9% |
| 4 | 11991 | 4.9% |
| Other values (2) | 5102 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 241912 | |
| Connector Punctuation | 4171 | 1.7% |
| Dash Punctuation | 931 | 0.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 89916 | |
| 2 | 38954 | |
| 0 | 18253 | 7.5% |
| 9 | 15941 | 6.6% |
| 8 | 15700 | 6.5% |
| 5 | 13902 | 5.7% |
| 3 | 12780 | 5.3% |
| 7 | 12291 | 5.1% |
| 6 | 12184 | 5.0% |
| 4 | 11991 | 5.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4171 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 931 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 247014 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 89916 | |
| 2 | 38954 | |
| 0 | 18253 | 7.4% |
| 9 | 15941 | 6.5% |
| 8 | 15700 | 6.4% |
| 5 | 13902 | 5.6% |
| 3 | 12780 | 5.2% |
| 7 | 12291 | 5.0% |
| 6 | 12184 | 4.9% |
| 4 | 11991 | 4.9% |
| Other values (2) | 5102 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 247014 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 89916 | |
| 2 | 38954 | |
| 0 | 18253 | 7.4% |
| 9 | 15941 | 6.5% |
| 8 | 15700 | 6.4% |
| 5 | 13902 | 5.6% |
| 3 | 12780 | 5.2% |
| 7 | 12291 | 5.0% |
| 6 | 12184 | 4.9% |
| 4 | 11991 | 4.9% |
| Other values (2) | 5102 | 2.1% |
Changed_Credit_Limit
Categorical
| Distinct | 4605 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| _ | 3150 |
|---|---|
| 11.5 | 197 |
| 8.22 | 189 |
| 11.32 | 189 |
| 7.35 | 181 |
| Other values (4600) |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 4.70556 |
| Min length | 1 |
Characters and Unicode
| Total characters | 705834 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 480 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 11.27 |
|---|---|
| 2nd row | 13.27 |
| 3rd row | 12.27 |
| 4th row | 11.27 |
| 5th row | 5.42 |
Common Values
| Value | Count | Frequency (%) |
| _ | 3150 | 2.1% |
| 11.5 | 197 | 0.1% |
| 8.22 | 189 | 0.1% |
| 11.32 | 189 | 0.1% |
| 7.35 | 181 | 0.1% |
| 10.06 | 178 | 0.1% |
| 8.23 | 169 | 0.1% |
| 7.69 | 166 | 0.1% |
| 7.01 | 165 | 0.1% |
| 11.49 | 164 | 0.1% |
| Other values (4595) | 145252 |
Length
| Value | Count | Frequency (%) |
| 3150 | 2.1% | |
| 11.5 | 197 | 0.1% |
| 8.22 | 189 | 0.1% |
| 11.32 | 189 | 0.1% |
| 7.35 | 181 | 0.1% |
| 10.06 | 178 | 0.1% |
| 8.23 | 169 | 0.1% |
| 3.93 | 168 | 0.1% |
| 7.69 | 166 | 0.1% |
| 7.01 | 165 | 0.1% |
| Other values (3879) | 145248 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 146850 | |
| 1 | 103312 | |
| 9 | 69985 | |
| 0 | 59693 | |
| 2 | 54609 | 7.7% |
| 7 | 45960 | 6.5% |
| 8 | 45785 | 6.5% |
| 5 | 44455 | 6.3% |
| 6 | 43906 | 6.2% |
| 3 | 43013 | 6.1% |
| Other values (3) | 48266 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 553413 | |
| Other Punctuation | 146850 | 20.8% |
| Connector Punctuation | 3150 | 0.4% |
| Dash Punctuation | 2421 | 0.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 103312 | |
| 9 | 69985 | |
| 0 | 59693 | |
| 2 | 54609 | |
| 7 | 45960 | |
| 8 | 45785 | |
| 5 | 44455 | |
| 6 | 43906 | |
| 3 | 43013 | |
| 4 | 42695 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 146850 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3150 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2421 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 705834 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 146850 | |
| 1 | 103312 | |
| 9 | 69985 | |
| 0 | 59693 | |
| 2 | 54609 | 7.7% |
| 7 | 45960 | 6.5% |
| 8 | 45785 | 6.5% |
| 5 | 44455 | 6.3% |
| 6 | 43906 | 6.2% |
| 3 | 43013 | 6.1% |
| Other values (3) | 48266 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 705834 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 146850 | |
| 1 | 103312 | |
| 9 | 69985 | |
| 0 | 59693 | |
| 2 | 54609 | 7.7% |
| 7 | 45960 | 6.5% |
| 8 | 45785 | 6.5% |
| 5 | 44455 | 6.3% |
| 6 | 43906 | 6.2% |
| 3 | 43013 | 6.1% |
| Other values (3) | 48266 | 6.8% |
Num_Credit_Inquiries
Real number (ℝ)
HIGH CORRELATION  MISSING  ZEROS 
| Distinct | 1607 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 3000 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.529014 |
| Minimum | 0 |
|---|---|
| Maximum | 2597 |
| Zeros | 8074 |
| Zeros (%) | 5.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 14 |
| Maximum | 2597 |
| Range | 2597 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 194.45606 |
|---|---|
| Coefficient of variation (CV) | 6.8160807 |
| Kurtosis | 99.144085 |
| Mean | 28.529014 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 9.7183174 |
| Sum | 4193765 |
| Variance | 37813.158 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 15673 | |
| 6 | 12486 | 8.3% |
| 3 | 12356 | 8.2% |
| 7 | 12353 | 8.2% |
| 8 | 11788 | 7.9% |
| 2 | 10482 | 7.0% |
| 5 | 10402 | 6.9% |
| 1 | 9335 | 6.2% |
| 9 | 8806 | 5.9% |
| 0 | 8074 | 5.4% |
| Other values (1597) | 35245 |
| Value | Count | Frequency (%) |
| 0 | 8074 | |
| 1 | 9335 | |
| 2 | 10482 | |
| 3 | 12356 | |
| 4 | 15673 | |
| 5 | 10402 | |
| 6 | 12486 | |
| 7 | 12353 | |
| 8 | 11788 | |
| 9 | 8806 |
| Value | Count | Frequency (%) |
| 2597 | 1 | < 0.1% |
| 2594 | 1 | < 0.1% |
| 2593 | 1 | < 0.1% |
| 2592 | 3 | |
| 2589 | 2 | |
| 2588 | 2 | |
| 2587 | 1 | < 0.1% |
| 2586 | 2 | |
| 2583 | 2 | |
| 2580 | 1 | < 0.1% |
Credit_Mix
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Standard | |
|---|---|
| Good | |
| _ | |
| Bad |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.67258 |
| Min length | 1 |
Characters and Unicode
| Total characters | 700887 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Good |
|---|---|
| 2nd row | Good |
| 3rd row | Good |
| 4th row | Good |
| 5th row | Good |
Common Values
| Value | Count | Frequency (%) |
| Standard | 54858 | |
| Good | 36597 | |
| _ | 30000 | |
| Bad | 28545 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| standard | 54858 | |
| good | 36597 | |
| 30000 | ||
| bad | 28545 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 174858 | |
| a | 138261 | |
| o | 73194 | |
| S | 54858 | 7.8% |
| t | 54858 | 7.8% |
| n | 54858 | 7.8% |
| r | 54858 | 7.8% |
| G | 36597 | 5.2% |
| _ | 30000 | 4.3% |
| B | 28545 | 4.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 550887 | |
| Uppercase Letter | 120000 | 17.1% |
| Connector Punctuation | 30000 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 174858 | |
| a | 138261 | |
| o | 73194 | |
| t | 54858 | 10.0% |
| n | 54858 | 10.0% |
| r | 54858 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 54858 | |
| G | 36597 | |
| B | 28545 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 30000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 670887 | |
| Common | 30000 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 174858 | |
| a | 138261 | |
| o | 73194 | |
| S | 54858 | 8.2% |
| t | 54858 | 8.2% |
| n | 54858 | 8.2% |
| r | 54858 | 8.2% |
| G | 36597 | 5.5% |
| B | 28545 | 4.3% |
Common
| Value | Count | Frequency (%) |
| _ | 30000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 700887 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 174858 | |
| a | 138261 | |
| o | 73194 | |
| S | 54858 | 7.8% |
| t | 54858 | 7.8% |
| n | 54858 | 7.8% |
| r | 54858 | 7.8% |
| G | 36597 | 5.2% |
| _ | 30000 | 4.3% |
| B | 28545 | 4.1% |
Outstanding_Debt
Categorical
| Distinct | 13622 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| 1360.45 | 36 |
|---|---|
| 1151.7 | 35 |
| 1109.03 | 35 |
| 460.46 | 35 |
| 935.74 | 24 |
| Other values (13617) |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.4332 |
| Min length | 3 |
Characters and Unicode
| Total characters | 964980 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1340 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | 809.98 |
|---|---|
| 2nd row | 809.98 |
| 3rd row | 809.98 |
| 4th row | 809.98 |
| 5th row | 605.03 |
Common Values
| Value | Count | Frequency (%) |
| 1360.45 | 36 | < 0.1% |
| 1151.7 | 35 | < 0.1% |
| 1109.03 | 35 | < 0.1% |
| 460.46 | 35 | < 0.1% |
| 935.74 | 24 | < 0.1% |
| 1292.14 | 24 | < 0.1% |
| 1454.68 | 24 | < 0.1% |
| 1024.56 | 24 | < 0.1% |
| 438.75 | 24 | < 0.1% |
| 1072.42 | 24 | < 0.1% |
| Other values (13612) | 149715 |
Length
| Value | Count | Frequency (%) |
| 1360.45 | 36 | < 0.1% |
| 1109.03 | 36 | < 0.1% |
| 460.46 | 36 | < 0.1% |
| 1151.7 | 36 | < 0.1% |
| 1464.16 | 24 | < 0.1% |
| 1194.38 | 24 | < 0.1% |
| 10.29 | 24 | < 0.1% |
| 585.77 | 24 | < 0.1% |
| 462.11 | 24 | < 0.1% |
| 969.19 | 24 | < 0.1% |
| Other values (12193) | 149712 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 150000 | |
| 1 | 125352 | |
| 2 | 95904 | |
| 3 | 88260 | |
| 4 | 87528 | |
| 5 | 74196 | |
| 6 | 73368 | |
| 8 | 72012 | |
| 7 | 71496 | |
| 9 | 70716 | |
| Other values (2) | 56148 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 813480 | |
| Other Punctuation | 150000 | 15.5% |
| Connector Punctuation | 1500 | 0.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 125352 | |
| 2 | 95904 | |
| 3 | 88260 | |
| 4 | 87528 | |
| 5 | 74196 | |
| 6 | 73368 | |
| 8 | 72012 | |
| 7 | 71496 | |
| 9 | 70716 | |
| 0 | 54648 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 150000 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1500 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 964980 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 150000 | |
| 1 | 125352 | |
| 2 | 95904 | |
| 3 | 88260 | |
| 4 | 87528 | |
| 5 | 74196 | |
| 6 | 73368 | |
| 8 | 72012 | |
| 7 | 71496 | |
| 9 | 70716 | |
| Other values (2) | 56148 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 964980 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 150000 | |
| 1 | 125352 | |
| 2 | 95904 | |
| 3 | 88260 | |
| 4 | 87528 | |
| 5 | 74196 | |
| 6 | 73368 | |
| 8 | 72012 | |
| 7 | 71496 | |
| 9 | 70716 | |
| Other values (2) | 56148 | 5.8% |
Credit_Utilization_Ratio
Real number (ℝ)
| Distinct | 150000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 32.283309 |
| Minimum | 20 |
|---|---|
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 20 |
|---|---|
| 5-th percentile | 24.247778 |
| Q1 | 28.054731 |
| median | 32.297058 |
| Q3 | 36.487954 |
| 95-th percentile | 40.228918 |
| Maximum | 50 |
| Range | 30 |
| Interquartile range (IQR) | 8.4332227 |
Descriptive statistics
| Standard deviation | 5.1133154 |
|---|---|
| Coefficient of variation (CV) | 0.15838883 |
| Kurtosis | -0.94582088 |
| Mean | 32.283309 |
| Median Absolute Deviation (MAD) | 4.2156115 |
| Skewness | 0.031599852 |
| Sum | 4842496.3 |
| Variance | 26.145994 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 35.03040186 | 1 | < 0.1% |
| 26.91679533 | 1 | < 0.1% |
| 27.73605126 | 1 | < 0.1% |
| 40.98413994 | 1 | < 0.1% |
| 26.12971783 | 1 | < 0.1% |
| 24.78390632 | 1 | < 0.1% |
| 26.65025816 | 1 | < 0.1% |
| 23.86424414 | 1 | < 0.1% |
| 29.63812969 | 1 | < 0.1% |
| 31.87539864 | 1 | < 0.1% |
| Other values (149990) | 149990 |
| Value | Count | Frequency (%) |
| 20 | 1 | |
| 20.10076996 | 1 | |
| 20.1729419 | 1 | |
| 20.24413035 | 1 | |
| 20.25707336 | 1 | |
| 20.50965206 | 1 | |
| 20.62001732 | 1 | |
| 20.71974515 | 1 | |
| 20.73922549 | 1 | |
| 20.80058685 | 1 |
| Value | Count | Frequency (%) |
| 50 | 1 | |
| 49.56451935 | 1 | |
| 49.5223243 | 1 | |
| 49.25498298 | 1 | |
| 49.06427745 | 1 | |
| 48.54066309 | 1 | |
| 48.48985173 | 1 | |
| 48.33729091 | 1 | |
| 48.24700252 | 1 | |
| 48.22871401 | 1 |
Credit_History_Age
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 408 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 13500 |
| Missing (%) | 9.0% |
| Memory size | 1.1 MiB |
| 17 Years and 11 Months | 628 |
|---|---|
| 18 Years and 4 Months | 621 |
| 18 Years and 3 Months | 617 |
| 19 Years and 9 Months | 615 |
| 18 Years and 2 Months | 615 |
| Other values (403) |
Length
| Max length | 22 |
|---|---|
| Median length | 21 |
| Mean length | 20.982945 |
| Min length | 20 |
Characters and Unicode
| Total characters | 2864172 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 22 Years and 9 Months |
|---|---|
| 2nd row | 22 Years and 10 Months |
| 3rd row | 23 Years and 0 Months |
| 4th row | 27 Years and 3 Months |
| 5th row | 27 Years and 4 Months |
Common Values
| Value | Count | Frequency (%) |
| 17 Years and 11 Months | 628 | 0.4% |
| 18 Years and 4 Months | 621 | 0.4% |
| 18 Years and 3 Months | 617 | 0.4% |
| 19 Years and 9 Months | 615 | 0.4% |
| 18 Years and 2 Months | 615 | 0.4% |
| 16 Years and 2 Months | 612 | 0.4% |
| 18 Years and 1 Months | 612 | 0.4% |
| 16 Years and 1 Months | 610 | 0.4% |
| 18 Years and 0 Months | 609 | 0.4% |
| 19 Years and 5 Months | 608 | 0.4% |
| Other values (398) | 130353 | |
| (Missing) | 13500 | 9.0% |
Length
| Value | Count | Frequency (%) |
| years | 136500 | |
| and | 136500 | |
| months | 136500 | |
| 6 | 15883 | 2.3% |
| 8 | 15600 | 2.3% |
| 9 | 15544 | 2.3% |
| 10 | 15410 | 2.3% |
| 11 | 15400 | 2.3% |
| 7 | 15268 | 2.2% |
| 5 | 13893 | 2.0% |
| Other values (28) | 166002 |
Most occurring characters
| Value | Count | Frequency (%) |
| 546000 | ||
| a | 273000 | |
| s | 273000 | |
| n | 273000 | |
| o | 136500 | 4.8% |
| t | 136500 | 4.8% |
| Y | 136500 | 4.8% |
| e | 136500 | 4.8% |
| r | 136500 | 4.8% |
| d | 136500 | 4.8% |
| Other values (12) | 680172 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1638000 | |
| Space Separator | 546000 | 19.1% |
| Decimal Number | 407172 | 14.2% |
| Uppercase Letter | 273000 | 9.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 113106 | |
| 2 | 67564 | |
| 3 | 39070 | 9.6% |
| 0 | 37297 | 9.2% |
| 6 | 26941 | 6.6% |
| 9 | 26898 | 6.6% |
| 8 | 26897 | 6.6% |
| 7 | 26132 | 6.4% |
| 5 | 22715 | 5.6% |
| 4 | 20552 | 5.0% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 273000 | |
| s | 273000 | |
| n | 273000 | |
| o | 136500 | |
| t | 136500 | |
| e | 136500 | |
| r | 136500 | |
| d | 136500 | |
| h | 136500 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 136500 | |
| M | 136500 |
Space Separator
| Value | Count | Frequency (%) |
| 546000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1911000 | |
| Common | 953172 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 546000 | ||
| 1 | 113106 | 11.9% |
| 2 | 67564 | 7.1% |
| 3 | 39070 | 4.1% |
| 0 | 37297 | 3.9% |
| 6 | 26941 | 2.8% |
| 9 | 26898 | 2.8% |
| 8 | 26897 | 2.8% |
| 7 | 26132 | 2.7% |
| 5 | 22715 | 2.4% |
Latin
| Value | Count | Frequency (%) |
| a | 273000 | |
| s | 273000 | |
| n | 273000 | |
| o | 136500 | |
| t | 136500 | |
| Y | 136500 | |
| e | 136500 | |
| r | 136500 | |
| d | 136500 | |
| M | 136500 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2864172 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 546000 | ||
| a | 273000 | |
| s | 273000 | |
| n | 273000 | |
| o | 136500 | 4.8% |
| t | 136500 | 4.8% |
| Y | 136500 | 4.8% |
| e | 136500 | 4.8% |
| r | 136500 | 4.8% |
| d | 136500 | 4.8% |
| Other values (12) | 680172 |
Payment_of_Min_Amount
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Yes | |
|---|---|
| No | |
| NM |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.5232267 |
| Min length | 2 |
Characters and Unicode
| Total characters | 378484 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| Yes | 78484 | |
| No | 53516 | |
| NM | 18000 | 12.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| yes | 78484 | |
| no | 53516 | |
| nm | 18000 | 12.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 78484 | |
| e | 78484 | |
| s | 78484 | |
| N | 71516 | |
| o | 53516 | |
| M | 18000 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 210484 | |
| Uppercase Letter | 168000 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 78484 | |
| N | 71516 | |
| M | 18000 | 10.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 78484 | |
| s | 78484 | |
| o | 53516 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 378484 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 78484 | |
| e | 78484 | |
| s | 78484 | |
| N | 71516 | |
| o | 53516 | |
| M | 18000 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 378484 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 78484 | |
| e | 78484 | |
| s | 78484 | |
| N | 71516 | |
| o | 53516 | |
| M | 18000 | 4.8% |
Total_EMI_per_month
Real number (ℝ)
| Distinct | 16960 |
|---|---|
| Distinct (%) | 11.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1432.5136 |
| Minimum | 0 |
|---|---|
| Maximum | 82398 |
| Zeros | 15615 |
| Zeros (%) | 10.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 30.947775 |
| median | 71.280006 |
| Q3 | 166.27956 |
| 95-th percentile | 499.73202 |
| Maximum | 82398 |
| Range | 82398 |
| Interquartile range (IQR) | 135.33178 |
Descriptive statistics
| Standard deviation | 8403.76 |
|---|---|
| Coefficient of variation (CV) | 5.8664435 |
| Kurtosis | 51.398183 |
| Mean | 1432.5136 |
| Median Absolute Deviation (MAD) | 51.776274 |
| Skewness | 7.0497762 |
| Sum | 2.1487704 × 108 |
| Variance | 70623182 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 15615 | 10.4% |
| 49.57494921 | 12 | < 0.1% |
| 27.97205372 | 12 | < 0.1% |
| 48.84009702 | 12 | < 0.1% |
| 24.25711798 | 12 | < 0.1% |
| 111.6372765 | 12 | < 0.1% |
| 212.4293521 | 12 | < 0.1% |
| 18.69416213 | 12 | < 0.1% |
| 126.5799472 | 12 | < 0.1% |
| 203.6057951 | 12 | < 0.1% |
| Other values (16950) | 134277 |
| Value | Count | Frequency (%) |
| 0 | 15615 | |
| 4.462837467 | 12 | < 0.1% |
| 4.713183572 | 12 | < 0.1% |
| 4.865689677 | 12 | < 0.1% |
| 4.916138542 | 12 | < 0.1% |
| 5.138484696 | 12 | < 0.1% |
| 5.218466359 | 12 | < 0.1% |
| 5.24927327 | 11 | < 0.1% |
| 5.262291048 | 12 | < 0.1% |
| 5.351086151 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 82398 | 1 | |
| 82347 | 1 | |
| 82331 | 1 | |
| 82316 | 1 | |
| 82256 | 1 | |
| 82248 | 1 | |
| 82236 | 1 | |
| 82235 | 1 | |
| 82225 | 1 | |
| 82204 | 1 |
Amount_invested_monthly
Categorical
HIGH CARDINALITY  MISSING 
| Distinct | 136497 |
|---|---|
| Distinct (%) | 95.3% |
| Missing | 6750 |
| Missing (%) | 4.5% |
| Memory size | 1.1 MiB |
| __10000__ | 6480 |
|---|---|
| 0.0 | 275 |
| 79.77734815487014 | 1 |
| 70.78372395611446 | 1 |
| 36.319514426769054 | 1 |
| Other values (136492) |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 16.962729 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2429911 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 136495 ? |
|---|---|
| Unique (%) | 95.3% |
Sample
| 1st row | 236.64268203272135 |
|---|---|
| 2nd row | 21.465380264657146 |
| 3rd row | 148.23393788500925 |
| 4th row | 39.08251089460281 |
| 5th row | 39.684018417945296 |
Common Values
| Value | Count | Frequency (%) |
| __10000__ | 6480 | 4.3% |
| 0.0 | 275 | 0.2% |
| 79.77734815487014 | 1 | < 0.1% |
| 70.78372395611446 | 1 | < 0.1% |
| 36.319514426769054 | 1 | < 0.1% |
| 152.64729262606082 | 1 | < 0.1% |
| 25.795644267454087 | 1 | < 0.1% |
| 238.46444826179817 | 1 | < 0.1% |
| 236.64268203272135 | 1 | < 0.1% |
| 199.24670014227908 | 1 | < 0.1% |
| Other values (136487) | 136487 | |
| (Missing) | 6750 | 4.5% |
Length
| Value | Count | Frequency (%) |
| 10000 | 6480 | 4.5% |
| 0.0 | 275 | 0.2% |
| 177.95183568608738 | 1 | < 0.1% |
| 181.0072171318061 | 1 | < 0.1% |
| 251.62736875017606 | 1 | < 0.1% |
| 72.68014533363515 | 1 | < 0.1% |
| 153.53448761392985 | 1 | < 0.1% |
| 397.50365354404653 | 1 | < 0.1% |
| 453.6151305781054 | 1 | < 0.1% |
| 841.2322359154716 | 1 | < 0.1% |
| Other values (136487) | 136487 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 259899 | |
| 2 | 234298 | |
| 4 | 227447 | |
| 3 | 226553 | |
| 0 | 226115 | |
| 5 | 224410 | |
| 6 | 223483 | |
| 8 | 217962 | |
| 7 | 217847 | |
| 9 | 209207 | |
| Other values (2) | 162690 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2267221 | |
| Other Punctuation | 136770 | 5.6% |
| Connector Punctuation | 25920 | 1.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 259899 | |
| 2 | 234298 | |
| 4 | 227447 | |
| 3 | 226553 | |
| 0 | 226115 | |
| 5 | 224410 | |
| 6 | 223483 | |
| 8 | 217962 | |
| 7 | 217847 | |
| 9 | 209207 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 136770 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 25920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2429911 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 259899 | |
| 2 | 234298 | |
| 4 | 227447 | |
| 3 | 226553 | |
| 0 | 226115 | |
| 5 | 224410 | |
| 6 | 223483 | |
| 8 | 217962 | |
| 7 | 217847 | |
| 9 | 209207 | |
| Other values (2) | 162690 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2429911 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 259899 | |
| 2 | 234298 | |
| 4 | 227447 | |
| 3 | 226553 | |
| 0 | 226115 | |
| 5 | 224410 | |
| 6 | 223483 | |
| 8 | 217962 | |
| 7 | 217847 | |
| 9 | 209207 | |
| Other values (2) | 162690 |
Payment_Behaviour
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.1 MiB |
| Low_spent_Small_value_payments | |
|---|---|
| High_spent_Medium_value_payments | |
| Low_spent_Medium_value_payments | |
| High_spent_Large_value_payments | |
| High_spent_Small_value_payments | |
| Other values (2) |
Length
| Max length | 32 |
|---|---|
| Median length | 31 |
| Mean length | 28.917187 |
| Min length | 6 |
Characters and Unicode
| Total characters | 4337578 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Low_spent_Small_value_payments |
|---|---|
| 2nd row | High_spent_Medium_value_payments |
| 3rd row | Low_spent_Medium_value_payments |
| 4th row | High_spent_Medium_value_payments |
| 5th row | High_spent_Large_value_payments |
Common Values
| Value | Count | Frequency (%) |
| Low_spent_Small_value_payments | 38207 | |
| High_spent_Medium_value_payments | 26462 | |
| Low_spent_Medium_value_payments | 20698 | |
| High_spent_Large_value_payments | 20565 | |
| High_spent_Small_value_payments | 16991 | |
| Low_spent_Large_value_payments | 15677 | |
| !@9#%8 | 11400 | 7.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| low_spent_small_value_payments | 38207 | |
| high_spent_medium_value_payments | 26462 | |
| low_spent_medium_value_payments | 20698 | |
| high_spent_large_value_payments | 20565 | |
| high_spent_small_value_payments | 16991 | |
| low_spent_large_value_payments | 15677 | |
| 9#%8 | 11400 | 7.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 554400 | |
| e | 499202 | |
| a | 368640 | 8.5% |
| s | 277200 | 6.4% |
| p | 277200 | 6.4% |
| n | 277200 | 6.4% |
| t | 277200 | 6.4% |
| l | 248996 | 5.7% |
| m | 240958 | 5.6% |
| u | 185760 | 4.3% |
| Other values (19) | 1130822 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3437578 | |
| Connector Punctuation | 554400 | 12.8% |
| Uppercase Letter | 277200 | 6.4% |
| Other Punctuation | 45600 | 1.1% |
| Decimal Number | 22800 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 499202 | |
| a | 368640 | |
| s | 277200 | |
| p | 277200 | |
| n | 277200 | |
| t | 277200 | |
| l | 248996 | 7.2% |
| m | 240958 | 7.0% |
| u | 185760 | 5.4% |
| v | 138600 | 4.0% |
| Other values (8) | 646622 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 110824 | |
| H | 64018 | |
| S | 55198 | |
| M | 47160 |
Other Punctuation
| Value | Count | Frequency (%) |
| ! | 11400 | |
| @ | 11400 | |
| # | 11400 | |
| % | 11400 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 11400 | |
| 8 | 11400 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 554400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3714778 | |
| Common | 622800 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 499202 | |
| a | 368640 | 9.9% |
| s | 277200 | 7.5% |
| p | 277200 | 7.5% |
| n | 277200 | 7.5% |
| t | 277200 | 7.5% |
| l | 248996 | 6.7% |
| m | 240958 | 6.5% |
| u | 185760 | 5.0% |
| v | 138600 | 3.7% |
| Other values (12) | 923822 |
Common
| Value | Count | Frequency (%) |
| _ | 554400 | |
| ! | 11400 | 1.8% |
| @ | 11400 | 1.8% |
| 9 | 11400 | 1.8% |
| # | 11400 | 1.8% |
| % | 11400 | 1.8% |
| 8 | 11400 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4337578 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 554400 | |
| e | 499202 | |
| a | 368640 | 8.5% |
| s | 277200 | 6.4% |
| p | 277200 | 6.4% |
| n | 277200 | 6.4% |
| t | 277200 | 6.4% |
| l | 248996 | 5.7% |
| m | 240958 | 5.6% |
| u | 185760 | 4.3% |
| Other values (19) | 1130822 |
Monthly_Balance
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 1762 |
|---|---|
| Missing (%) | 1.2% |
| Memory size | 1.1 MiB |
Credit_Score
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 50000 |
| Missing (%) | 33.3% |
| Memory size | 1.1 MiB |
| Standard | |
|---|---|
| Poor | |
| Good |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 6.12696 |
| Min length | 4 |
Characters and Unicode
| Total characters | 612696 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Good |
|---|---|
| 2nd row | Good |
| 3rd row | Good |
| 4th row | Good |
| 5th row | Good |
Common Values
| Value | Count | Frequency (%) |
| Standard | 53174 | |
| Poor | 28998 | |
| Good | 17828 | 11.9% |
| (Missing) | 50000 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| standard | 53174 | |
| poor | 28998 | |
| good | 17828 | 17.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 124176 | |
| a | 106348 | |
| o | 93652 | |
| r | 82172 | |
| S | 53174 | |
| t | 53174 | |
| n | 53174 | |
| P | 28998 | 4.7% |
| G | 17828 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 512696 | |
| Uppercase Letter | 100000 | 16.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 124176 | |
| a | 106348 | |
| o | 93652 | |
| r | 82172 | |
| t | 53174 | |
| n | 53174 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 53174 | |
| P | 28998 | |
| G | 17828 | 17.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 612696 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 124176 | |
| a | 106348 | |
| o | 93652 | |
| r | 82172 | |
| S | 53174 | |
| t | 53174 | |
| n | 53174 | |
| P | 28998 | 4.7% |
| G | 17828 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 612696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| d | 124176 | |
| a | 106348 | |
| o | 93652 | |
| r | 82172 | |
| S | 53174 | |
| t | 53174 | |
| n | 53174 | |
| P | 28998 | 4.7% |
| G | 17828 | 2.9% |
| Monthly_Inhand_Salary | Num_Bank_Accounts | Num_Credit_Card | Interest_Rate | Delay_from_due_date | Num_Credit_Inquiries | Credit_Utilization_Ratio | Total_EMI_per_month | Month | Occupation | Credit_Mix | Payment_of_Min_Amount | Payment_Behaviour | Credit_Score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Monthly_Inhand_Salary | 1.000 | -0.262 | -0.193 | -0.283 | -0.241 | -0.261 | 0.135 | 0.451 | 0.000 | 0.027 | 0.213 | 0.230 | 0.168 | 0.181 |
| Num_Bank_Accounts | -0.262 | 1.000 | 0.400 | 0.555 | 0.557 | 0.482 | -0.064 | 0.106 | 0.004 | 0.000 | 0.000 | 0.004 | 0.000 | 0.004 |
| Num_Credit_Card | -0.193 | 0.400 | 1.000 | 0.426 | 0.422 | 0.390 | -0.047 | 0.100 | 0.002 | 0.004 | 0.000 | 0.000 | 0.004 | 0.000 |
| Interest_Rate | -0.283 | 0.555 | 0.426 | 1.000 | 0.549 | 0.569 | -0.065 | 0.138 | 0.004 | 0.004 | 0.006 | 0.004 | 0.000 | 0.002 |
| Delay_from_due_date | -0.241 | 0.557 | 0.422 | 0.549 | 1.000 | 0.490 | -0.061 | 0.132 | 0.000 | 0.026 | 0.424 | 0.357 | 0.038 | 0.339 |
| Num_Credit_Inquiries | -0.261 | 0.482 | 0.390 | 0.569 | 0.490 | 1.000 | -0.066 | 0.171 | 0.003 | 0.001 | 0.002 | 0.002 | 0.004 | 0.008 |
| Credit_Utilization_Ratio | 0.135 | -0.064 | -0.047 | -0.065 | -0.061 | -0.066 | 1.000 | 0.008 | 0.000 | 0.003 | 0.065 | 0.074 | 0.074 | 0.045 |
| Total_EMI_per_month | 0.451 | 0.106 | 0.100 | 0.138 | 0.132 | 0.171 | 0.008 | 1.000 | 0.004 | 0.000 | 0.000 | 0.005 | 0.001 | 0.005 |
| Month | 0.000 | 0.004 | 0.002 | 0.004 | 0.000 | 0.003 | 0.000 | 0.004 | 1.000 | 0.000 | 0.002 | 0.000 | 0.000 | 0.031 |
| Occupation | 0.027 | 0.000 | 0.004 | 0.004 | 0.026 | 0.001 | 0.003 | 0.000 | 0.000 | 1.000 | 0.023 | 0.017 | 0.003 | 0.027 |
| Credit_Mix | 0.213 | 0.000 | 0.000 | 0.006 | 0.424 | 0.002 | 0.065 | 0.000 | 0.002 | 0.023 | 1.000 | 0.487 | 0.062 | 0.402 |
| Payment_of_Min_Amount | 0.230 | 0.004 | 0.000 | 0.004 | 0.357 | 0.002 | 0.074 | 0.005 | 0.000 | 0.017 | 0.487 | 1.000 | 0.071 | 0.313 |
| Payment_Behaviour | 0.168 | 0.000 | 0.004 | 0.000 | 0.038 | 0.004 | 0.074 | 0.001 | 0.000 | 0.003 | 0.062 | 0.071 | 1.000 | 0.084 |
| Credit_Score | 0.181 | 0.004 | 0.000 | 0.002 | 0.339 | 0.008 | 0.045 | 0.005 | 0.031 | 0.027 | 0.402 | 0.313 | 0.084 | 1.000 |
| ID | Customer_ID | Month | Name | Age | SSN | Occupation | Annual_Income | Monthly_Inhand_Salary | Num_Bank_Accounts | Num_Credit_Card | Interest_Rate | Num_of_Loan | Type_of_Loan | Delay_from_due_date | Num_of_Delayed_Payment | Changed_Credit_Limit | Num_Credit_Inquiries | Credit_Mix | Outstanding_Debt | Credit_Utilization_Ratio | Credit_History_Age | Payment_of_Min_Amount | Total_EMI_per_month | Amount_invested_monthly | Payment_Behaviour | Monthly_Balance | Credit_Score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0x160a | CUS_0xd40 | September | Aaron Maashoh | 23 | 821-00-0265 | Scientist | 19114.12 | 1824.843333 | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | 3 | 7 | 11.27 | 2022.0 | Good | 809.98 | 35.030402 | 22 Years and 9 Months | No | 49.574949 | 236.64268203272135 | Low_spent_Small_value_payments | 186.26670208571772 | NaN |
| 1 | 0x160b | CUS_0xd40 | October | Aaron Maashoh | 24 | 821-00-0265 | Scientist | 19114.12 | 1824.843333 | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | 3 | 9 | 13.27 | 4.0 | Good | 809.98 | 33.053114 | 22 Years and 10 Months | No | 49.574949 | 21.465380264657146 | High_spent_Medium_value_payments | 361.44400385378196 | NaN |
| 2 | 0x160c | CUS_0xd40 | November | Aaron Maashoh | 24 | 821-00-0265 | Scientist | 19114.12 | 1824.843333 | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | -1 | 4 | 12.27 | 4.0 | Good | 809.98 | 33.811894 | NaN | No | 49.574949 | 148.23393788500925 | Low_spent_Medium_value_payments | 264.67544623342997 | NaN |
| 3 | 0x160d | CUS_0xd40 | December | Aaron Maashoh | 24_ | 821-00-0265 | Scientist | 19114.12 | NaN | 3 | 4 | 3 | 4 | Auto Loan, Credit-Builder Loan, Personal Loan, and Home Equity Loan | 4 | 5 | 11.27 | 4.0 | Good | 809.98 | 32.430559 | 23 Years and 0 Months | No | 49.574949 | 39.08251089460281 | High_spent_Medium_value_payments | 343.82687322383634 | NaN |
| 4 | 0x1616 | CUS_0x21b1 | September | Rick Rothackerj | 28 | 004-07-5839 | _______ | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | 1 | 5.42 | 5.0 | Good | 605.03 | 25.926822 | 27 Years and 3 Months | No | 18.816215 | 39.684018417945296 | High_spent_Large_value_payments | 485.2984336755923 | NaN |
| 5 | 0x1617 | CUS_0x21b1 | October | Rick Rothackerj | 28 | #F%$D@*&8 | Teacher | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | 3 | 5.42 | 5.0 | Good | 605.03 | 30.116600 | 27 Years and 4 Months | No | 18.816215 | 251.62736875017606 | Low_spent_Large_value_payments | 303.3550833433617 | NaN |
| 6 | 0x1618 | CUS_0x21b1 | November | Rick Rothackerj | 28 | 004-07-5839 | Teacher | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | NaN | 5.42 | 5.0 | _ | 605.03 | 30.996424 | 27 Years and 5 Months | No | 18.816215 | 72.68014533363515 | High_spent_Large_value_payments | 452.30230675990265 | NaN |
| 7 | 0x1619 | CUS_0x21b1 | December | Rick Rothackerj | 28 | 004-07-5839 | Teacher | 34847.84 | 3037.986667 | 2 | 4 | 6 | 1 | Credit-Builder Loan | 3 | 2_ | 7.42 | 5.0 | _ | 605.03 | 33.875167 | 27 Years and 6 Months | No | 18.816215 | 153.53448761392985 | !@9#%8 | 421.44796447960783 | NaN |
| 8 | 0x1622 | CUS_0x2dbc | September | Langep | 35 | 486-85-3974 | Engineer | 143162.64 | NaN | 1 | 5 | 8 | 3 | Auto Loan, Auto Loan, and Not Specified | 8 | 1942 | 7.1 | 3.0 | Good | 1303.01 | 35.229707 | 18 Years and 5 Months | No | 246.992319 | 397.50365354404653 | Low_spent_Medium_value_payments | 854.2260270022115 | NaN |
| 9 | 0x1623 | CUS_0x2dbc | October | Langep | 35 | 486-85-3974 | Engineer | 143162.64 | 12187.220000 | 1 | 5 | 8 | 3 | Auto Loan, Auto Loan, and Not Specified | 6 | 3 | 2.1 | 3.0 | Good | 1303.01 | 35.685836 | 18 Years and 6 Months | No | 246.992319 | 453.6151305781054 | Low_spent_Large_value_payments | 788.1145499681528 | NaN |
| ID | Customer_ID | Month | Name | Age | SSN | Occupation | Annual_Income | Monthly_Inhand_Salary | Num_Bank_Accounts | Num_Credit_Card | Interest_Rate | Num_of_Loan | Type_of_Loan | Delay_from_due_date | Num_of_Delayed_Payment | Changed_Credit_Limit | Num_Credit_Inquiries | Credit_Mix | Outstanding_Debt | Credit_Utilization_Ratio | Credit_History_Age | Payment_of_Min_Amount | Total_EMI_per_month | Amount_invested_monthly | Payment_Behaviour | Monthly_Balance | Credit_Score | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 149990 | 0x25fe0 | CUS_0x8600 | July | Sarah McBridec | 28 | 031-35-0942 | Architect | 20002.88 | 1929.906667 | 10 | 8 | 29 | 5 | Personal Loan, Auto Loan, Mortgage Loan, Student Loan, and Student Loan | 33 | 26 | 18.31 | 9.0 | Bad | 3571.7 | 25.123535 | NaN | Yes | 60.964772 | 173.2755025599617 | Low_spent_Large_value_payments | 228.750392 | Standard |
| 149991 | 0x25fe1 | CUS_0x8600 | August | Sarah McBridec | 29 | 031-35-0942 | Architect | 20002.88 | 1929.906667 | 10 | 8 | 29 | 5 | Personal Loan, Auto Loan, Mortgage Loan, Student Loan, and Student Loan | 33 | 25 | 18.31 | 9.0 | Bad | 3571.7 | 37.140784 | 6 Years and 3 Months | Yes | 60.964772 | 34.66290609052614 | High_spent_Large_value_payments | 337.362988 | Standard |
| 149992 | 0x25fe6 | CUS_0x942c | January | Nicks | 24 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 23 | NaN | 9.5 | 3.0 | _ | 502.38 | 32.991333 | 31 Years and 3 Months | No | 35.104023 | 401.1964806036356 | Low_spent_Small_value_payments | 189.64108 | Poor |
| 149993 | 0x25fe7 | CUS_0x942c | February | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99_ | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 23 | NaN | 11.5 | 3.0 | Good | 502.38 | 29.135447 | 31 Years and 4 Months | No | 58638.000000 | 180.7330951944497 | Low_spent_Medium_value_payments | 400.104466 | Standard |
| 149994 | 0x25fe8 | CUS_0x942c | March | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 20 | 6 | 9.5 | 3.0 | _ | 502.38 | 39.323569 | 31 Years and 5 Months | No | 35.104023 | 140.58140274528395 | High_spent_Medium_value_payments | 410.256158 | Poor |
| 149995 | 0x25fe9 | CUS_0x942c | April | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 23 | 7 | 11.5 | 3.0 | _ | 502.38 | 34.663572 | 31 Years and 6 Months | No | 35.104023 | 60.97133255718485 | High_spent_Large_value_payments | 479.866228 | Poor |
| 149996 | 0x25fea | CUS_0x942c | May | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 18 | 7 | 11.5 | 3.0 | _ | 502.38 | 40.565631 | 31 Years and 7 Months | No | 35.104023 | 54.18595028760385 | High_spent_Medium_value_payments | 496.65161 | Poor |
| 149997 | 0x25feb | CUS_0x942c | June | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 5729 | 2 | Auto Loan, and Student Loan | 27 | 6 | 11.5 | 3.0 | Good | 502.38 | 41.255522 | 31 Years and 8 Months | No | 35.104023 | 24.02847744864441 | High_spent_Large_value_payments | 516.809083 | Poor |
| 149998 | 0x25fec | CUS_0x942c | July | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99 | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 20 | NaN | 11.5 | 3.0 | Good | 502.38 | 33.638208 | 31 Years and 9 Months | No | 35.104023 | 251.67258219721603 | Low_spent_Large_value_payments | 319.164979 | Standard |
| 149999 | 0x25fed | CUS_0x942c | August | Nicks | 25 | 078-73-5990 | Mechanic | 39628.99_ | 3359.415833 | 4 | 6 | 7 | 2 | Auto Loan, and Student Loan | 18 | 6 | 11.5 | 3.0 | Good | 502.38 | 34.192463 | 31 Years and 10 Months | No | 35.104023 | 167.1638651610451 | !@9#%8 | 393.673696 | Poor |